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Abstract 

Suppose that we sequentially put n balls into n bins. If we put each 
ball into a random bin then the heaviest bin will contain log n/ log log n 
balls (w.h.p.). However, Azar, Broder, Karlin and Upfal JLj showed 
that if for each ball we choose two bins at random and put it in the 
least loaded bin among the two then the heaviest bin will contain 
only log log n balls (w.h.p). How much memory do we need to im- 
plement this scheme? We need roughly log log log n bits per bin, and 
n log log log n bits in total. 

Let us assume now that we have limited amount of memory. For 
each ball, we are given two random bins and we have to put the ball 
into one of them. Our goal is to minimize the load of the heaviest 
bin. We prove that if we have bits then the heaviest bin will 

contain at least 17(5 log n/ log log n) balls. The bound is tight in the 
communication complexity model. 
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1 Introduction 



In this paper, we analyze the balanced allocation problem in the case of 
limited memory. Specifically, we prove the following theorem. In Section [Sj 
we show that the theorem is tight in the communication complexity model. 

Theorem 1.1. We are sequentially given n balls. We have to put each of 
them into one of two bins chosen uniformly at random among n bins. We 
have only M = n^^^ < n/logn bits of memory (6 may depend on n); our 
choice where to put a ball can depend only on these memory bits and random 
bits. Then the heaviest bin will contain 2io^°fo"n balls with probability 1 — o(l). 



Remark 1.1. One could assume that if the number of memory bits is o{n) 
then the heaviest bin should contain ~ , " balls. However, that is not the 

log log n ' 

case as the following example shows. Let us group all bins into n/ log log n 
clusters; each cluster consists o/ log log n bins. For each cluster, we keep the 
total number of balls in the bins that form the cluster. Now given a ball and 
two bins, we put the ball into the bin whose cluster contains fewer balls. The 
result of Azar, Broder, Karlin and Upfal JT^ implies that w.h.p. each cluster 
will contain at most n/\(^\ogn log n = 2 log log n balls. Therefore, each 
bin will also contain at most 2 log log n balls. Finally, to keep the number of 
balls in each cluster we need — r — log log log n = o(n) bits in total. 
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2 Proof of Theorem 1.1 



Consider one step of the process: we are given two bins chosen uniformly 
at random, and we put the ball into one of them. Let Pi = pf^^ be the 
probability that we put the ball into bin i given that the memory state is 
m G {1, . . . , 2^^}. Let F^ = F^ = {i : p^ < e/n]. 

Claim 2.1. 

1. Fore every set of bins S, the probability that we put a ball in a bin from 
S is at least e\S\ F^\/n. 
2. 

\m < en. 
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Proof. 1. The desired probability equals 

2. The probability that both chosen bins are in is \F^\'^/n'^. Therefore, 

the probability t that we put the ball into a bin from is at least \F^\'^/n'^. 
On the other hand, we have 



elF^ I 

^ I ml 



n 

We conclude that \F^\ < en. □ 

We divide the process into L consecutive phases. In each phase, we put 
\n/L\ balls into bins. Let Si be the set of bins that contain at least i balls 
at the end of the phase i; let 5*0 = {1, ... , n}. Now we will prove a bound on 
the size of Si that in turn will imply Theorem 11.11 

Lemma 2.2. Let L = ^ logn/ log log n, e = 1/(2L). For every i E {0, . . . , L}, 

let £i be the event that for every ki,. . . , ki-i: \Si\\j^ZlFkj \ > (5/(4L))*n/2. 
Then 

Pi{£i) = l-o{l). 

In particular, PrdS'^l > 0) > Pt{Sl) = 1 — o{l), and therefore, in the end, 
the heaviest bin contains at least L balls w.h.p. 

Proof. First, note that the event So always holds: 

> n — Leu = n/2. 

Now we shall prove that Pr(£^j|£^j„i) < o(l/L), and thus 



L 



Pt{S,) > 1 - J]Pr(^,|£:,_i) - Pr(^"o) = 1 - o(l). 
i=i 

Assume that Si_i holds. Fix fci, . . . , /c^.j. We are going to estimate the 
number of bins in Si^i \ IJj"=i -^fcj which we put a ball into during the phase 
i. All those bins belong to Si. 
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Consider one step of the process. Let be the set of bins in 5'j_i\|J^r^* F^. 
where we have aheady put a ball into. Let m be the state of the memory. 
Since £i-i holds, 



L-i 



m^i\[jF,^)\Fj> 



i=i 



By Claim 12. Il l, we put a ball into a bin from (5'i_i \ IJj=i -^fcj) \ ^ with 
probability at least 



n 



n 



That is, if |A^| < {e/ (4L))*~^n/4, we add a new bin to with probability at 
least (e:/(4L))*^^ x e/A. Therefore, by the Chernoff bound, the probability 
that the size of at the end of the phase is at least 

ix(./(4L))-x£x„/L=(£» 

is at least 1 - 2 • 2-V8(£/(4L))»n_ g-^^g g, y ji-i ^ (^^^ ^^le 
phase). 



(L-i 

for fixed fci, . . . , 

By the union bound: 



-n 



L-i 



Pr(for all fci, . . . , kL-^ : IJ ^'^.l ^ (^/(4^)) V2|^^-i) 



> 1 - 2 • (2*^)^2"^/^(^/('^^))'" > 1 - 2 • 2^^-(i/(8^'))''"/8 

r / r / 1 1 \ log Ji/ log log n 

nl-^f L-n'^f ) ' ^ / 

) V V\/2<51ogny ' 



> 1 - 2 ■ 2 



>l-ol- 



□ 
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3 Matching Upper Bound 



In this section, we will prove that our bound is tight in the communication 
complexity model. Specifically, we present an algorithm that gets M — 'n}~^ 
bits of advice before each ball is thrown, and ensures that the heaviest bin 
contains at most 0(5 log / log log n) balls w.h.p. 

Observe that no matter which of the two bins we choose at each step, the 
probability pi that we put the ball in a bin i is at most 2/n. Therefore, the 
probability that after n steps the total number of balls in the bin i exceeds 
T = is at most e-%{l + o(l)) = 0((f )^) = o{l/{nnogn)) (i.e. the 

probability that the Poisson r.v. with A = 2 exceeds T). Thus the number 
of bins that contain at least T balls is at most n^/(21ogn) w.h.p. Before 
each step, our algorithm receives the list L of such bins, and the number of 
balls in each of them. Now if one of the two randomly chosen bins belongs 
to L and the other does not, the algorithm puts the ball into the bin that is 
not in L; if both bins are in L, the algorithm puts the ball into the bin with 
fewer balls; finally, if both bins are not in L the algorithm puts the ball into 
an arbitrary bin. 

Let us estimate the load of the heaviest bin. By the definition, each bin 
that is not in L contains at most T balls. Now we bound the number of balls 
in bins that arc in L. We say that a ball is an "extra ball" if we put it into 
a bin that is in L (at the moment when we put the ball). Let us now count 
only extra balls. Note that every time we get a ball, we either "discard it" 
— put it into a bin that is not in L, and thus do not count it — or put into 
one of the two bins that contains fewer "extra balls" , i.e. we use the scheme 
of Azar, Broder, Karlin, and Upfal. Then their analysis shows that each bin 
contains at most log log n extra balls w.h.p. Therefore, each bin contains at 
most log^iog^ + log log n balls w.h.p. We established the following theorem. 

Theorem 3.1. There exists an algorithm that gets M — bits of advice 
before each step and uses no other memory, and ensures that the heaviest bin 
contains at most Qf /^"^" ) balls w.h.p. 

^ log log n' ^ 
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